Joint Loop End Modeling Improves Covariance Model Based Non-coding RNA Gene Search
نویسنده
چکیده
The effect of more detailed modeling of the interface between stem and loop in non-coding RNA hairpin structures on efficacy of covariance-model-based non-coding RNA gene search is examined. Currently, the prior probabilities of the two stem nucleotides and two loop-end nucleotides at the interface are treated the same as any other stem and loop nucleotides respectively. Laboratory thermodynamic studies show that hairpin stability is dependent on the identities of these four nucleotides, but this is not taken into account in current covariance models. It is shown that separate estimation of emission priors for these nucleotides and joint treatment of substitution probabilities for the two loop-end nucleotides leads to improved non-coding RNA gene search.
منابع مشابه
Thermodynamic matchers for the construction of the cuckoo RNA family
RNA family models describe classes of functionally related, non-coding RNAs based on sequence and structure conservation. The most important method for modeling RNA families is the use of covariance models, which are stochastic models that serve in the discovery of yet unknown, homologous RNAs. However, the performance of covariance models in finding remote homologs is poor for RNA families wit...
متن کاملEfficient non-coding RNA gene searches through classical and evolutionary methods
Successful non-coding RNA gene searching requires examination of long-range intramolecular base pairing possibilities. This results in search algorithms with extremely long run times such that large-scale use of the algorithms often becomes computationally infeasible. Methods for the efficient search of the solution space are examined. A review of the standard dynamic-programming covariance mod...
متن کاملAcceleration of Covariance Models for Non-coding RNA Search
Stochastic context-free grammar (SCFG) based models for non-coding RNA (ncRNA) gene searches are much more powerful than regular grammar based models due to the ability to model intermolecular base pairing. The SCFG models (also known as covariance models) can be scored exactly using dynamic programming techniques. However, the computational resources needed to compute optimal scores using dyna...
متن کاملThe Role of Long Non Coding RNAs in the Repair of DNA Double Strand Breaks
DNA double strand breaks (DSBs) are abrasions caused in both strands of the DNA duplex following exposure to both exogenous and endogenous conditions. Such abrasions have deleterious effect in cells leading to genome rearrangements and cell death. A number of repair systems including homologous recombination (HR) and non-homologous end-joining (NHEJ) have been evolved to minimize the fatal effe...
متن کاملCMfinder - a covariance model based RNA motif finding algorithm
MOTIVATION The recent discoveries of large numbers of non-coding RNAs and computational advances in genome-scale RNA search create a need for tools for automatic, high quality identification and characterization of conserved RNA motifs that can be readily used for database search. Previous tools fall short of this goal. RESULTS CMfinder is a new tool to predict RNA motifs in unaligned sequenc...
متن کامل